Kharoṣṭhī |
|
---|---|
Type | Abugida |
Languages | Gandhari Prakrit Tocharian Kuchean |
Time period | 4th century BCE - 3rd century CE |
Parent systems | |
Sister systems | Brāhmī Nabataean Syriac Palmyrenean Mandaic Pahlavi Sogdian |
ISO 15924 | Khar, 305 |
Direction | Right-to-left |
Unicode alias | Kharoshthi |
Unicode range | U+10A00—U+10A5F |
Note: This page may contain IPA phonetic symbols. |
The Kharoṣṭhī script, is an ancient abugida (or "alphasyllabary") used by the Gandhara culture ancient South Asia to write the Gāndhārī and Sanskrit languages. It was in use from the middle of the 3rd century BCE until it died out in its homeland around the 3rd century CE. It was also in use in Kushan, Sogdiana (see Issyk kurgan) and along the Silk Road where there is some evidence it may have survived until the 7th century in the remote way stations of Khotan and Niya. Kharoṣṭhī is encoded in the Unicode range U+10A00—U+10A5F, from version 4.1.0.
Contents |
Kharoṣṭhī is mostly written right to left (type A), but some inscriptions (type B) already show the left to right direction that was to become universal for the later South Asian scripts.
Each syllable includes the short a sound by default, with other vowels being indicated by diacritic marks. Recent epigraphical evidence highlighted by Professor Richard Salomon of the University of Washington has shown that the order of letters in the Kharoṣṭhī script follows what has become known as the Arapacana Alphabet. As preserved in Sanskrit documents the alphabet runs:
Some variations in both the number and order of syllables occur in extant texts.
Kharoṣṭhī includes only one standalone vowel sign which is used for initial vowels in words. Other initial vowels use the a character modified by diacritics. Using epigraphic evidence Salomon has established that the vowel order is a e i o u, rather than the usual vowel order for Indic scripts a i u e o. This is the same as the Semitic vowel order. Also, there is no differentiation between long and short vowels in kharoshti. Both are marked using the same vowel markers
The alphabet was used by Buddhists as a mnemonic for remembering a series of verses relating to the nature of phenomena. In Tantric Buddhism this list was incorporated into ritual practices, and later became enshrined in mantras.
a | i | u | e | o | ṛ |
k | kh | g | gh | |
c | ch | j | ñ | |
ṭ | ṭh | ḍ | ḍh | ṇ |
t | th | d | dh | n |
p | ph | b | bh | m |
y | r | l | v | |
ś | ṣ | s | h |
ḱ | ṭ́h |
۱ | ۲ | ۳ | ㄨ | ۱ㄨ | ۲ㄨ | ۳ㄨ | ㄨㄨ | ۱ㄨㄨ |
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
੭ | Ȝ | ੭Ȝ | ȜȜ | ੭ȜȜ | ȜȜȜ | ੭ȜȜȜ | ||
10 | 20 | 30 | 40 | 50 | 60 | 70 | ||
ʎ۱ | ʎ۲ | |||||||
100 | 200 |
Kharoṣṭhī included a set of numerals that are reminiscent of Roman numerals. The symbols were I for the unit, X for four (perhaps representative of four lines or directions), ੭ for ten (doubled for twenty), and ʎ for the hundreds multiplier. The system is based on an additive and a multiplicative principle, but does not have the subtractive feature used in the Roman number system.[1]
1 | 2 | 3 | 4 | 10 | 20 | 100 | 1000 |
Note that the table beside reads right-to-left, just like the Kharoṣṭhī abugida itself and the displayed numbers.
The numerals are encoded by Unicode at codepoints U+10A40 to U+10A47:
10A40
𐩀
One
|
10A41
𐩁
Two
|
10A42
𐩂
Three
|
10A43
𐩃
Four
|
10A44
𐩄
Ten
|
10A45
𐩅
Twenty
|
10A46
𐩆
One Hundred
|
10A47
𐩇
One Thousand
|
The Kharoṣṭhī script was deciphered by James Prinsep (1799–1840), using the bilingual coins of the Indo-Greeks (Obverse in Greek, reverse in Pāli, using the Kharoṣṭhī script). This in turn led to the reading of the Edicts of Ashoka, some of which, from the northwest of the Asian subcontinent, were written in the Kharoṣṭhī script.
Scholars are not in agreement as to whether the Kharoṣṭhī script evolved gradually, or was the deliberate work of a single inventor. An analysis of the script forms shows a clear dependency on the Aramaic alphabet but with extensive modifications to support the sounds found in Indic languages. One model is that the Aramaic script arrived with the Achaemenid conquest of the region of northwest India in 500 BCE and evolved over the next 200+ years to reach its final form by the 3rd century BCE where it appears in some of the Edicts of Ashoka found in northwestern part of the Indian.However, no intermediate forms have yet been found to confirm this evolutionary model, and rock and coin inscriptions from the 3rd century BCE onward show a unified and standard form.
The study of the Kharoṣṭhī script was recently invigorated by the discovery of the Gandharan Buddhist Texts, a set of birch-bark manuscripts written in Kharoṣṭhī, discovered near the Afghan city of Hadda just west of the Khyber Pass in modern Pakistan. The manuscripts were donated to the British Library in 1994. The entire set of manuscripts are dated to the 1st century CE, making them the oldest Buddhist manuscripts yet discovered.
|
In the early 20th century inscriptions and documents in two new related (but mutually unintelligible) languages were discovered at various sites in the Tarim Basin written in Brahmi script. It was soon found that they belonged to the Indo-European family of languages. Our only records of the now-extinct "Tokharian A" (from the region of Turfan and Karashahr), and "Tokharian B" (mainly from the region of Kucha, but also found elsewhere), are of relatively late date – 6th to 8th century CE, when written records appear; but it is likely they arrived in the region much earlier. They are now extinct, and scholars are still trying to piece together a fuller picture of these languages, their origins, history and connections, etc.[2]
Kharosthi was added to the Unicode Standard in March, 2005 with the release of version 4.1.
The Unicode block for Kharosthi is U+10A00–U+10A5F:
Kharoshthi[1] Unicode.org chart (PDF) |
||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+10A0x | 𐨀 | 𐨁 | 𐨂 | 𐨃 | 𐨅 | 𐨆 | 𐨌 | 𐨍 | 𐨎 | 𐨏 | ||||||
U+10A1x | 𐨐 | 𐨑 | 𐨒 | 𐨓 | 𐨕 | 𐨖 | 𐨗 | 𐨙 | 𐨚 | 𐨛 | 𐨜 | 𐨝 | 𐨞 | 𐨟 | ||
U+10A2x | 𐨠 | 𐨡 | 𐨢 | 𐨣 | 𐨤 | 𐨥 | 𐨦 | 𐨧 | 𐨨 | 𐨩 | 𐨪 | 𐨫 | 𐨬 | 𐨭 | 𐨮 | 𐨯 |
U+10A3x | 𐨰 | 𐨱 | 𐨲 | 𐨳 | 𐨸 | 𐨹 | 𐨺 | 𐨿 | ||||||||
U+10A4x | 𐩀 | 𐩁 | 𐩂 | 𐩃 | 𐩄 | 𐩅 | 𐩆 | 𐩇 | ||||||||
U+10A5x | 𐩐 | 𐩑 | 𐩒 | 𐩓 | 𐩔 | 𐩕 | 𐩖 | 𐩗 | 𐩘 | |||||||
Notes
|